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Abstract. We consider the embedding problem in coding theory: given an 
independence (a code-related property) and an independent language L , find 
a maximal independent language containing L. We consider the case where 
the code-related property is defined via a rational binary relation that is de¬ 
creasing with respect to any fixed total order on the set of words. Our method 
works by iterating a max-nrin operator that has been used before for the embed¬ 
ding problem for properties defined by length-increasing-and-transitive binary 
relations. By going to order-decreasing rational relations, represented by input- 
decreasing transducers, we are able to include many known properties from both 
the noiseless and noisy domains of coding theory, as well as any combination 
of such properties. Moreover, in many cases the desired maximal embedding is 
effectively computable. 
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1 Introduction 

The embedding problem for a language L satisfying a property V is to find 
a language L' that contains L and is maximal satisfying V . This problem is 
meaningful when the property V is an independence. In particular, many nat¬ 
ural code-related properties are independences with respect to binary relations 
on words. In this setting, a binary relation p defines the property that consists 
of all languages in which no two different words are related via p. Such lan¬ 
guages are called p-independent. The embedding problem has been addressed 
well for properties defined by length-increasing-and-transitive relations [24], as 
well as for several fixed properties like the bifix code property [28], the solid 
code property [14], and the bounded deciphering delay property [3]. In [6], the 
authors consider properties where the relation p is rational and, therefore, de¬ 
scribed by a finite transducer t. In this setting, assuming the given language L 
is regular, one can decide whether L is a maximal t-independent language. The 
contributions of the present paper are as follows. 

• We introduce the concept of input-decreasing transducer, which realizes 
order-decreasing relations, as a tool for defining many natural code-related 
properties, including variable-length code properties and error-detection 
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properties, as well as any combinations of those. Assuming a fixed, but 
arbitrary total order on words, an input-decreasing transducer t is such 
that, for any input word w, all output words of t have a (strictly) smaller 
order than w. 

• We show that starting with any t-independent language L , we can embed 
L into a maximal t-independent language /x^L, by iterating the max- 
min operator fi t on L. The non-iterated operator /it is considered in 
[24] where it is shown that if t is length-decreasing and transitive, then 
any t-independent language L is embedded into fitL which is maximal 
t-independent. In many cases, /i* converges after finitely many steps. We 
also show a natural example of a t where fi^ does not converge after 
finitely many steps. 

• Our embedding results hold for any fixed, but arbitrary, language M rel¬ 
ative to which maximality is considered, that is, we embed any L C M 
into a maximal t-independent subset of M -this idea of relative maximal¬ 
ity has been considered before, e.g., in [4,6,20]. When M is finite, fit 
always converges after i iterations, for some i, to fi\L. When both M and 
L are regular and fit converges after finitely many operations, then /xJL 
is computable. With our approach we provide a solution to the embed¬ 
ding problem for many classical cases of both variable-length codes (for 
M = all possible words) and error-detecting codes for substitution and for 
synchronization types of errors (for M = all words of a certain length). 

The paper is organized as follows. The next section contains information about 
the basic notation and terminology used in the paper, and Section 3 provides 
some background information on independent languages and maximal embed¬ 
dings, and introduces the iterated max-min operator. Section 4 contains a 
few technical results and the weak condition of a transducer being smooth, 
which guarantees that when the max-min operator converges in finitely many 
iterations, then it produces a maximal embedding. Section 5 focuses on input- 
decreasing transducers, which are always smooth and guarantee that the iterated 
max-min operator produces a maximal embedding. Section 6 demonstrates with 
several examples that the concept of input-decreasing transducer can be used 
to define many known properties from both the noiseless and noisy domains of 
coding theory. In that section we also show an example of an input-decreasing 
transducer for which the iterated operator does not converge finitely. Finally, 
the last section contains a few concluding remarks and directions for future 
research. 


2 Basic notions and notation 

In this section we present our notation and terminology about words, languages, 
transducers and word operators. 
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We write N, No for the sets of natural numbers (not including 0) and non¬ 
negative integers, respectively. If S is a set, then |Sj denotes the cardinality of 
S , and 2 s denotes the set of all subsets of S. An alphabet is a finite nonempty 
set of symbols. In this paper, we write E for any arbitrary alphabet. The set 
of all words, or strings, over E is written as E* and includes the empty word 
A. A language (over E) is any set of words. In the rest of this paragraph, we 
use the following arbitrary object names: i,j for nonnegative integers, /\, L for 
languages and u, v, w, x, y for words. If w £ L then we say that w is an L-word. 
When there is no risk of confusion, we write a singleton language {w} simply as 

w. For example, LUw and vLlw mean LU{w} and {r>}U{u>}, respectively. We 
use standard operations and notation on words and languages [18,21,25]. For 
example, |w|, uv. w l , KL, L l , L*, L + denote respectively, the length of w, the 
concatenation of u and v, the word consisting of i copies of w, the concatenation 
of K and L, the language consisting of all words obtained by concatenating any 
i L- words, the Kleene star of L , and L + = L* \ A. If w is of the form uv then u 
is a prefix and v is a suffix of w. If w is of the form uxv then x is an infix 
of w. If u ^ w then u is called a proper prefix of w —the definitions of proper 
suffix and proper infix are similar. 

Transducers and (word) relations [1, 22, 26]. A (word) relation over E 
is a subset of E* x E*, that is, a set of pairs (x,y) of words over the alphabet. 
The inverse of a relation p, denoted by p , is the relation {(y,x) | (, x , y) £ p}. 
The relation is transitive if (x,y), ( y,z ) £ p implies (x,z) £ p, for all words 

x, y,z\ that is, p o p C p, where ‘o’ denotes composition. Following [24], the 
relation is called length-increasing (resp. length-decreasing) if (x,y) £ p 
implies |a:| < \y\ (resp. |x| > \y\). 

A (finite) transducer is a quintuple t = (Q,E,T, I,F) such that Q is the set 
of states, /, F C Q are the sets of initial and final states, respectively, E is the 
alphabet and T C Q x E* x E* x Q is the finite set of transitions. Note that, in 
general transducers, one considers an input and an output alphabet, but in this 
paper the input and output alphabets are the same. The relation realized 
by the transducer t, denoted by R(t), is the set of labels in all the accepting 
paths of t. We write t(*) for the set of possible outputs of t on input x, 
that is, y £ t(x) iff (x,y) £ R(t). This notation is extended naturally to any 
language X: 

t( X ) = U 

The inverse of a transducer t, denoted by t , is the transducer that results 
from t by simply switching the input and output parts of the labels in the 
transitions of t. It follows that t _1 realizes the inverse of the relation realized 
by t. If t and s are transducers, then there are (effectively) a transducer (t V s) 
realizing R(t)UR(s) and a transducer (tos) realizing R(t)oR(s). By composing a 
transducer with itself i times, for i £ N, we obtain a transducer which we denote 
by t*. We define 

4 (t- 1 ) 1 . 
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Remark 1. For all t £ N, we have that 

R(t _i ) = (R(t i )) _1 - 

Indeed, note that (y,x) £ R(t _I ) <=>• “there are words x%,..., Xi-% such that 
(y, X \) £ R(t~ 1 ), (xi, X 2 ) £ R(t _1 ), ..., (xi-i,x) £ R(t -1 )” “there are words 
xi,... ,Xi -1 such that (x,xt- 1 ) £ R(t),..., (X 2 ,xi) £ R(t), (xi,y) £ R(t)” o 
(x,y) £ R(t l ) <s=> (y,x) £ (R(t*)) -1 . 

A transducer t is transitive if R(t) is transitive, that is, R(t 2 ) C R(t). 
For any regular language L, the relations R(t) PI (E* x L) and R(t) fl (L x E*) 
are regular. The details of a transducer realizing R(t) fl (E* x L), denoted by 
11 L, and of a transducer realizing R(t) n(IxE*), denoted by t 4.A, are shown 
in [11]; thus, 


u £ (t t L)(w) if and only if u £ t (w) and u £ L. 

Language operators. A language operator is a function Op : 2 s —> 2 s . 
If Op is any language operator, X is any language and i is any nonnegative 
integer, then we can define the following language operators. 

Op°(X) = X and Op i+1 (X) = Op(Op ?; (AT)) 


Op^(X) =IU Op(AT) U • • • U Op l (X) 


Op^(X) = Op* (AT) U Op i+1 (X) U • • • 


Op*(X) = U-oOp^X), Op+(X) = U^Op’pO 
Op n (x) = n- 1 Op i (x) 

If Opj^ is also a language operator then we write 

Op C Op x 

to indicate that Op(X) C Op 1 (X) for all languages X. 

We view a transducer t as a language operator, so the expressions t* and t n , 
for instance, are legitimate in this paper. With this convention we can say that 
a transducer is transitive if and only if 

t 2 C t. 

For transducer operators we also have that t(UjXj) = Ujt(Xj), for all language 
families (A,;),;. Using the above notation for language operators and Remark 1 
we have the following. 

Remark 2. If the transducer t is transitive then t + = t and t _1 is also transitive. 
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3 Codes and the max-min operator 

Here we provide background information on code-related properties (indepen¬ 
dence properties) and introduce the iterated max-min operator that is used to 
embed a given independent language to a maximal one. A property (over E) is 
any set V of languages. If L is in V then we say that L satisfies V. A code 
property, or independence, [10], is a property V for which there is n £ NU{Ko} 
such that 

L £ V, if and only if L' £ V, for all L' C L with 0 < |L'| < n, 

that is, L satisfies the property exactly when all nonempty subsets of L with 
less than n elements satisfy the property. In the rest of the paper we only 
consider properties V that are independences. A language L £ V is called 
"P-maximal, or a maximal V code, if L U w ^ V for any word w ^ L. From [10] 
we have that every L satisfying V is included in a maximal V code. To our 
knowledge, with possibly very few exceptions, all known code related properties 
in the literature [2,5,6,10,17,19,23,27] are code properties as defined above. 
In this work we focus on input-altering transducer properties. A transducer t 
is called input-altering if w ^ t(re), for all words w. A language L is called 
t- independent if 

t(L) n A = 0. (1) 

The independence Vt described by t is the set of all t-independent languages. 
It is easy to verify that the above equation is equivalent to 

t" 1 (L)ni = 0 (2) 

and also to 

(t _1 Vt)(L) fU = 0 (3) 

Thus, any of t, t , t V t _1 can be used to describe the same code property. 
Remark 3. Let t be an input-altering transducer. Every singleton language {«;} 
is t-independent. 

Remark 4. The approach of input-altering transducers constitutes a realization 
in algorithmic terms of independences defined via binary relations and includes 
many known properties such as prefix codes, bifix codes, outfix codes, and many 
error-detecting languages, as well as all the intersections of any two such prop¬ 
erties. In particular, for any binary relation p, a language L is p- independent 
if 

u,v £ L and (u , v) £ p implies u = v. (4) 

The above statement implies that p- independence is the same as p -1 indepen¬ 
dence. Let p^ = {(a :,y) £ p \ x ^ y}. If p^ is rational then there is an 
input-altering transducer t realizing it, and condition (4) is equivalent to any 
of (1)—(3) above. The representation of code properties by transducers (or 
other formal objects such as trajectories [5]) has lead to the implementation of 
a package for manipulating objects representing code properties [7], as well as 
to an online tool for answering questions about code properties [15]. 
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In the rest of the paper we consider a fixed, but arbitrary, input-altering 
transducer t, and a fixed, but arbitrary, language M. Let X be any language. 
We define the following language operators. 

I t (X) =M-(t(X)Ut~ 1 (X)) and /x t (X) = I t (X) - 

When the transducer t is understood, we omit above the subscript t. Also, as 
the operator /x is used heavily, we usually omit parentheses when applying /x on 
a language X. So the two operators are also written, respectively, as 

I(X) = M-(t(X)Ut- 1 (X)) and /jX =/j(X) = 1{X) - t~ x (I(X)) 

The above operators are essentially translated to our transducer notation from 
the corresponding ones in [24]. The operator I(-) is the set of all possible words 
that are either in X or t-independent from X, so in some sense it is the maximum 
set in which X can be embedded. However, two words in I(X) — X might be 
t-dependent. The operator mapping any Y to Y — t -1 (y) is the ‘t-minimize’ 
operator which returns all T-elements that cannot produce another T-element 
via t. The term ‘minimize’ makes sense in our context of input-decreasing 
transducers further below. 

Definition 5. The operator /xt, or simply p when t is understood, shown above 
is called the max-min operator. The operator fj,* is called the iterated max- 
min operator. We say that it converges finitely on a language L, if there 
is i £ No such that p.*L = p, l L. 

In the case of codes defined by length-increasing-and-transitive relations 
(equivalently, length-decreasing-and-transitive relations), already the language 
fiL is maximal and constitutes a solution to the embedding problem, where L 
is the given language satisfying the code property. As stated in [24], however, 
this does not work for other codes like bifix codes, and also for error-detecting 
codes. A main observation in this paper is that for any t-independent language 
L, the language /x*L is an embedding of L, provided that t satisfies a reasonable 
condition—see Section 5. 


4 Smooth Transducer Operators 

In this section we obtain several technical results about the max-min operator 
/x and we demonstrate Theorem 11, which states that when t is smooth and /x* 
converges finitely on some initial t-independent language L , then the resulting 
language is a t-independent maximal embedding of L. The concept of a smooth 
transducer is rather technical and is intended to keep the results general. All 
input-decreasing transducers of the next section are smooth. 

The second statement of the next lemma is the analogue of a statement in [5] 
concerning codes defined via trajectories. 

Lemma 6. Let X,Y be any languages and let L be a language satisfying the 
property Vt ■ The following statements hold true. 
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1. If A cy then 1(F) C 1(A). 

2. X — t _1 (A) satisfies Vt. 

3. fjIL satisfies Vt and L C /PL C /x* +1 L C 1(A), for all jgN. 


Proof. The first statement follows from the definition of I. For the second 
statement, we need to show that Eq. (1) holds for L = X — t _1 (A). For the sake 
of contradiction assume that there is w £ X — t _1 (A) and w £ t(A — t _1 (A)). 
Then w £ t (u) for some u £ X — t _1 (A^), which implies u ^ t -1 ^) and, then 
w (f t(it), which is impossible. 

For the third statement, we first show that for any language K satisfying 
Vt, we have 

K C fj.K C I(A") and /.iK satisfies Vt■ (5) 

The previous statement of the lemma implies that indeed /iK satisfies Vt- The 
definition of /r implies that f.iK C I(A'). Now, as K satisfies both Eq (1) and 
(2), we have that K f~l (t(A') Ut~ 1 2 3 * 5 (A')) = 0 and, therefore, K C I(A'). If it were 
the case that K nt _1 (I(A^)) ^ 0, then also t(A') nl(AT) ^ 0, which is impossible. 
Hence, K C fiK. Now the statement follows if we use L or fi l (L) in place of K 
in (5), taking also into account the first statement of the lemma. □ 

In going from the length-increasing-and-transitive binary relations of [24] 
to the input-altering ones of [6], we need to obtain a few somewhat subtle 
relationships between the operators fj,(-) and I(-). 

Definition 7. Let X be any language, and consider again our fixed input- 
altering transducer t. We define the following notation and concepts. 

1. ax.t = t _1 11(^0- When t is understood we simply write ax instead of 
ax, t- 

2. t is called exhaustive, if t n (A') = 0, for every language X. 

3. t is called smooth, if cr[J(I(A)) C a* x {n A), for every language A. 

One verifies that exhaustive t -1 implies smooth t. 

Lemma 8. Let X,Y be any languages. The following statements hold true. 

1. ax(X) = 0 = a~^(X) and ax(Y) C 1(A). Also, if X CY then 1(F) D 
t^ 1 {X) = 0 and try (A) C ax {A) for all languages A. 

2. fiX = 1(A) - cr*(1(A)) and 1(A) = /iX U ct y (I(A)). 

3. If X CY and i £ N then a\-(A) C a\(B) for all languages A,B with 
AC B. 

4- If t is transitive then also ax is transitive. 

5. 1(A) = a* x (jiX) U a2(I(A)). 
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6. If X = pX then 1(A) =XU 4 (1(A)). 

Proof. The first two statements follow from the definitions of the operators ax , 
I and p. For example, 

a x {X) =t- 1 (x)ni(x) =t~ 1 (X)n(M-t{X)-t- 1 {X)) = 0. 

The third statement follows from the second one using induction on i. The 
fourth statement follows when we note that t” 1 must be transitive (hence t” 2 C 
t _1 ) and, for all words u,v, u £ a' x ( v ) implies the existence of a word z such 
that u £ ax(z) and z £ ax{v), which implies u £ t~ 2 (v) and u £ I(A'). For 
the fifth statement, first note that 1(A) = pX U ax(pX) U <Ty(I(A)), which 
implies that, for all i £ No, 

I(X) = af(pX)Ua*+ 1 (I(X)). (6) 

If w £ a x {pX)\Ja^(l(X)), one uses Eq. (6) to show that w £ 1(A). Conversely, 
if w £ 1(A) and w crJO-PO)) then there is j £ N such that w £ a J x (l(X)) 
and, by Eq. (6) w £ a^~ 1 (pX). The last statement follows from the previous 
statements, when we note that 

I(X) = M AU4( M A)U4(I(X)). 


□ 

The first statement of the next lemma is needed in this section. The rest of 
the statements are used in the next section, but we include them here as they 
concern smooth transducers. The lemma uses the following notation, for i £ N 

A*A = p z X - /T" 1 A. (7) 

As before, when t is understood, it is omitted in the above notation. 

Lemma 9. Let igN, let X be any language and assume that the fixed trans¬ 
ducer t is smooth. Let L be any language satisfying Vt- The following statements 
hold true. 

1. 1(A) = a* x (pX) and pX D a x (pX) = 0. 

2 - f° r al1 j e 

5. l(p i L) C p i L U af_ 1L (A t L) C p i L U (r 1 )^ 2 (A i L) 
l \ +1 LCa^. lL (\L). 

5. a%(p^L)Caf_ 1L (p'L) 

6. af_ 1L (p'L) = af_ 1L (\L) C a^ 2i (A 1 L). 
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Proof. We use the previous lemma. In particular, as t is smooth, we have 


I(X) = a* x (pX). 


1. Now, pX = 1(A) - a x ( 1(A)) = <J* x {pX) - a+(pX) = pX - a+(pX). 
Thus, pX fl a x (pX) = 0. 

2. As a+_ li (/x i - 1 L) = 0, we have cr^-i L (p l L) = a IJ i-i L (A i L U p l ~ 1 L) = 
a r. i ~ 1 L(^iL) Ua^i-i L (p' l ~ 1 L) = a fJj i-i L (A i L). The statement now follows. 


3. Using the previous statement we have, 

l(^L) C l^~ l L) = = /TLU^-i^^O/L)) = 

p l L£\aT_ lL (A i L). Also, as l(p l L)nt~ 1 (p l L) = 0, we have that l(p l L) C 
p‘LUaf_ lL (A t L). 

4. As p l+1 L C l(p z L), the statement follows from the previous one. 

5 - V 2 z> l+li ) = Uj> 2 = 


U j>2(°U L (v i+lL ~ V iL ) V ^l^L)) = 

UU0) C 

Ui>2(^LKu-iL(^ i ))) = V-i L (^ L ) 

6. First note that, for all j £ N, 

%UlW L ) = C al( 


>2 


u J 1 L 




Then the statement follows when we use induction on i to show L {p l L) C 

°z 2i W. 

□ 


Lemma 10. Let L be a language satisfying the property Vt■ The following 
statements hold true. 

1. L is Vt-maximal if and only if I (L) C L. 

2. If L is Vt-maximal then L = p l L, for all i £ N. 

Proof. For the first statement, following [6] we have that L is "Pt-maximal if and 
only if, 

M-(LUt(L)Ut" 1 (L)) = 0, 

if and only if, I(L) — L = 0, if and only if, I(L) C L. 

For the second statement, let L be TA-maximal. Then I(L) C L, and the 
statement follows from Lemma 6. □ 
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Theorem 11. Let L be a language satisfying the property Vt- If t is smooth 
and there is i £ No such that p l+1 L = p l L then p l L is Vt-maximal and contains 

L. 

Proof. Assume p l+1 L = piL and let K = p l L. Then pK = K. As t is smooth, 
Lemmata 8 and 9 imply 

I (AT) = a* K {pK) = pK U a %(pK) = K U a^(K) = K. 

This implies that K is maximal using Lemma 10. □ 

The next theorem is a slightly stronger version of a result in [24] which states 
that L is included in the maximal pL when t is length-decreasing-and-transitive. 

Theorem 12. Let L be a language satisfying the property Vt- If t is transitive 
and smooth then pL is Vt-maximal and contains L. 

Proof. Assume t is transitive and smooth. Then, t 2 C t. By Lemma 10, it is 
sufficient to show that l(pL) C pL. First note that Lemmata 8 and 9 imply 
I (L) = pL U itl(pL). Then, 

l{pL) C I(L) = pL U er l(pL). 

By definition of I(-), we have I (pL) H t ~ l (pL) = 0, which implies that l(pL) C 
pL, as required. □ 

Example 13. The input-altering transducer ti in Fig. 1 is transitive and 
smooth but neither length-decreasing nor length-increasing. It is transitive be¬ 
cause of the fact that t 2 (x) = 0 for all x. This last fact implies that t\(x) = 0 
for all x and i > 2, which implies further that t]” 1 is exhaustive and, therefore, 
t is smooth. 

0/0 

— & 

Figure 1: An example of an input-altering transducer that is smooth 
and transitive, but neither length-decreasing nor length-increasing. 



5 Input-decreasing Transducer Properties 

In this section, we consider a fixed, but arbitrary, total order -< on the set 
£* of all words. Then, every word w has a position pos(w) with respect to 


10 




that order, starting from position 0. Moreover, v -< w implies v 7 ^ w, for any 
v,w G £*. We also consider a fixed, but arbitrary, transducer t such that 

y € t(x) implies y -< x 

for all words x, y. Any transducer satisfying the above condition is called an 

input-decreasing transducer. 

Definition 14. An input-decreasing transducer property is a property 
that is equal to Vt for some input-decreasing transducer t. 

Remark 15. Input-decreasing transducer properties are closed under intersec¬ 
tion, as (t V s) is input-decreasing when both t and s are. 

Remark 16. For any binary relation p , let 

P< = {(x,y) £p\x^y}, 

and assume that p ^ can be realized by an input-decreasing transducer t. One 
verifies that a language L is p-independent if and only if it satisfies Vt (that is, 
L is t-independent). 

Lemma 17. Consider the fixed input-decreasing transducer t. The following 
statements hold true. 

1. t is input-altering 

2. t{x) is finite, for all words x. 

3. t is smooth. 

Proof. The first two statements follow from the assumption that t is input- 
decreasing. For the last statement, we show that t _1 is exhaustive using contra¬ 
diction. So assume there is a language X and a word w such that w G (t _1 ) n (X). 
Let p be the position of w with respect to the total order -<. As w £ (t _1 ) p+1 (AT), 
there are words Xo,x\,... ,x p G X such that 

x\ G t - 1 (a;o), X 2 G t _ 1 (a;i),..., x p G t _ 1 (a;p_i), w G t _ 1 (cc p ). 

Then, xo A Xi -< • • ■ -< x p -< w, which implies that the position of w is greater 
than p , a contradiction. □ 

Theorem 18. Assume that t is input-decreasing. If a language L satisfies Vt 
then the language p*L is Vt-maximal and contains L. 

Proof. That p*L contains L follows from Lemma 6 . Also, using the same lemma 
it follows that no two words u,v in p*L are related via R(t) and, therefore, p,*L 
satisfies Vt- To show that p,*L is maximal we pick any word w G I(p,*L) and 
show that w G p*L —see Lemma 10. By the definition of the operator I, we 
have that 

w G p| (M - (t V t ~ l ){piL)) = p| I(//L). 

2>0 2>0 
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Then by Lemma 9, for every nonnegative integer i. we have that 

w e a*^ iL (n l+1 L) = (y l+1 L) U a^ L (y i+1 L) U a^ L (y z+1 L). 

Now let i = [pos(w)/2j. As w £ (M — (t V t~ 1 )(y‘ +1 L)), we have that w ^ 
^ L (^ +1 I). Also, by Lemma 9(6), any u G o~? l (h 1+1 L) must have pos(u) > 
2i + 2 and, therefore, w £ a^ L (y t+1 L). Hence, w G (y l+1 L) C y*L, as required. 

□ 


In the above theorem, the premise that t be input-decreasing is essential. 
This is shown next with examples. 

Example 19. Let p and s be the input-decreasing transducers describing, re¬ 
spectively, prefix codes and suffix codes. Let b = (pVs) be the input-decreasing 



Figure 2: The left transducer describes prefix codes: on input x 
it outputs any proper prefix of x. The right transducer describes 
suffix codes. Both transducers are input-decreasing. Note: in this 
and the following transducer figures, an arrow with label a/a rep¬ 
resents a set of edges with labels a/a for all a G E; and similarly 
for an arrow with label a/X. An arrow with label a/a' represents 
a set of edges with labels a/a! for all a, a! G E with a ^ a'. 


transducer describing bifix codes. Then, for any bifix code L , the language y^L 
is a maximal bifix code containing L. On the other hand, if we describe bifix 
codes using any of the three transducers 

(p -1 V s), (pVs" 1 ), (p- 1 Vs’ 1 ), 

then the theorem does not hold although all three are input-altering, none 
of them is input-decreasing. For example, with t being any of those three 
transducers, and for E = {0,1}, we have /x(001) = 001, hence /x*(001) = 001; 
that is on input 001, the iterated max-min operator returns 001 itself, which is 
not maximal bifix— here we have used FAdo [7] for computations on automata 
and transducers. 

Remark 20. If the language L is regular then also the language y l L is regular, 
for all i G N. This follows by the definition of y and the standard closure 
properties of regular languages. In particular, an automaton accepting y l L can 
be effectively computed from any automaton accepting L. Thus, if t is such 
that y* = y 1 for some index i. then a maximal regular embedding of any given 
regular L can be effectively computed. 
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Theorem 21. Assume that M is finite and t is input-decreasing. If a language 
L C M satisfies property Vt then there is i £ N such that the language p/L is 
regular, Vt-maximal and contains L. 

Proof. Let M = {wi,... ,w n } for some n £ N. As each t(wi) is finite there is 
Pi £ N such that t Pi {wi) = 0. Hence, t p (M) = 0, where p = ma Xi{pi}. Then we 
have that (t p ) _1 (M) = 0, and also (t _p )(M) = 0, by Remark 1. Then Lemma 9 
implies 

Ai !+rp/ 2 l (X) - ^rp /21 (£) c a^ 2[p/2] (pL) C ajf(pL) C 0. 

Hence, ph'/ 2 ! (L) is TA-maximal by Theorem 11. □ 

As before, the premise that t be input-decreasing is essential. This is shown 
next with an example. 

Example 22. Let subi be the input-altering transducer (shown below) de¬ 
scribing 1-substitution-detecting languages. A language L is fc-substitution 
detecting if no L-word can result into another L-word using up to k symbol 
substitutions (one substitution = one symbol replaced with another one). The 


a/a a/a 



Figure 3: This transducer describes 1 -substitution-detecting 
languages—see caption of the previous figure for explanations on 
transducer diagrams. It is input-altering but not input-decreasing. 


transducer is not input-decreasing, as 0 S subi(l) and 1 € subi(O). Moreover 
for t = sub^ we have that /i( 0000 ) = 0000 and, hence, /i*( 0000 ) = 0000 , which 
is not maximal 1 -substitution-detecting. 


6 Examples and further observations 

In this section we use the standard quasi-lexicographic (or radix) total order on 
all words over {0,1,.. ■ ,q — 1}, for some integer q > 2. Thus, u -< v means that, 
either u is shorter, or u and v are of the same length and, for the first position in 
which they differ, the symbol of u at that position is smaller than that of v. All 
the examples presented below have been confirmed using the well-maintained 
Python package FAdo [7], which was recently updated to include a module on 
codes described by input-altering transducers [ 12 ]. 

In our examples below we use notation of regular expressions. For instance, 
01 * 0(0 + 1 ) denotes the language { 01*0 | i G Nq}{ 0 , 1 }. 
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Example 23. Let M = {0, 1}* and t = b = the input-decreasing transducer 
describing bifix codes. We have that 

/i( 001 ) = { 001 , 000 , 10 , 11 }, and p 2 ( 001 ) = 01 * 0(0 + 1 ) + 10 + 11 

which is maximal. Again, we have 

g((0 + 1 ) 3 0 ) = (0 + l) 4 , which is maximal, 

/r 2 ((0 + 1) 3 H) = (0 + 1) 3 (0 + 10*1), which is maximal. 

The last code above is the reverse of a code in [2] , which is called there reversible 
Golomb-Rice code. Finally, note that p 5 on 11111 generates a maximal bifix 
code. 

For the next examples we use the two transducers shown below over the 
binary alphabet { 0 , 1 }. 


a/a a/a a/a a/a a/a 



Figure 4: On input x, the left transducer outputs any word re¬ 
sulting by substituting exactly one 1 in x with a 0. Note that 
(sub) 4 ) V (sub) 4 ) -1 is equal to the transducer subi and, there¬ 
fore, sub) 4 describes the 1 -substitution-detecting languages over 
{0,1}. The right transducer describes the 2-substitution-detecting 
languages. Both transducers are input-decreasing. 


Example 24. Let M = {0, l } 5 and t = sub) 4 = the input-decreasing trans¬ 
ducer describing 1-substitution-detecting languages. We have that 

p 3 ( 01111 ) = {w £ { 0 , l } 5 | w’s count of Is is even} 

This code is maximal and known as the even-parity code of length 5, which 
constitutes a vector space of dimension 4 consisting of 2 4 codewords. 

Example 25. Let M = {0, l } 7 and t = sub.) 4 = the input-decreasing trans¬ 
ducer describing 2-substitution-detecting languages. We have that 

p 6 (lllllll)={0000000,1001011,0101010,1100001,0011001,1010010, 
0110011,1111000,0000111,1001100,0101101,1100110, 
0011110,1010101,0110100,1111111} 

This code is the reverse of the Hamming code of length 7 [ 8 ]. It is maximal and 
constitutes a vector space of dimension 4 consisting of 2 4 codewords. It is also 
1 -substitution- correcting. 
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In the next example we use the input-decreasing transducer id^ shown in 
Fig 5, which describes 2-insertion-deletion-detecting languages. A language L 
is fc-insertion-deletion detecting if no L-word can result into another L- 
word using a total of up to k symbol insertions/deletions. The challenge in 
designing the transducer is to make sure that two insertion-deletion errors on 
some input word x do cause the resulting word to be different from x and smaller 
than x. Moreover, the transducer (id?) V (id^) 1 is such that, on any input 
word x, one or two insertion/deletion errors are applied resulting into a word 
not equal to x. The main idea is that id^ applies an insertion and a deletion in 
two ways: (i) A deletion of 1 immediately followed by either a 0 not changed or 
an inserted 0. This is justified, as deleting a 1 in a run of Is has the same effect 
as deleting the last 1 of that run; (ii) An insertion of a 0 immediately followed 
by either a 1 not changed or a deleted 1. Again, this is justified as inserting a 
0 in a run of Os has the same effect as inserting the 0 at the end of that run. 


a/a a/a 



Figure 5: This is an input-decreasing transducer describing the 
2-insertion-deletion-detecting languages over {0,1}. 


Example 26. Let M = {0, l} 6 and t = id^ = the input-decreasing transducer 
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describing 2-insertion-deletion-detecting languages. We have that 

/U 5 (001011) = {000000,001011,001100,010001,011101, 

101010,110000, noon, liiioo, mm} 

This code is maximal and consists of 10 codewords. Any 2-insertion-deletion- 
detecting code of fixed length has a Levenshtein distance greater than 2 and, 
therefore, it is also 1-insertion-deletion- correcting. We note that the Leven¬ 
shtein 1-insertion-deletion-correcting code of length 6 in [16] is maximal and 
consists of 10 codewords as well. 

Example 27. Let M = {0, l}- 6 and t = pVsub^ = the input-decreasing trans¬ 
ducer describing languages over {0,1} that are both 1-substitution-detecting and 
prefix codes. We have that 

p 5 (lll) = {0,10, 111, 1100,11010,110110} 

is maximal (relative to {0, l}- 6 ). 

The next result shows an example of an input-decreasing transducer and 
language on which the p* does not converge finitely. First we establish the 
following lemma. 

Lemma 28. Let n G No, let E = {0,1}, let L n = { 1,00, 010,..., 01 n-1 0}, and 
let t = pVsub^ = the input-decreasing transducer describing the languages that 
are both 1-substitution-detecting and prefix codes. We have that 

I(L„) =L„U01 ri E+. 

Proof. Recall that sub^ substitutes exactly one 1 with a 0, and hence, (sub^) -1 
substitutes exactly one 0 with a 1. Thus, 

(sub ?)-\L n ) = (10 + 110 + • • ■ + 1"0) + (01 + 011 + ■ ■ • + 01 n ). 

We use the notation (a:) 1 / 0 to denote the set of all words that result by substi¬ 
tuting exactly one 1 with a 0 in the word x. Thus, 

sub ?{L n ) = 0 + 0(l) 1/o 0 + ■ ■ • + 0(l n_1 ) 1/0 0. 
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Using the definition of I(-) and the fact S* = A + 0 + l + 0E + + 1E + , we have 

I {L n ) = E* - p (L n ) - L n T, + - sub^ (L n ) - (sub^)™ 1 ^) = 

1 + 0E+ - 00E+-01” _1 0E + - 01 - 01 2 -01” 

- 000 - 0(ll) 1/o 0-0(l” _1 ) 1/o 0 = 

(1 + 00) + (01E+ - 010E+-01” _1 0E + - 01 2 -01” 

- 000 - 0(ll) 1/0 0-0(l”- 1 ) 1/o 0) = 

(1 + 00 + 010) + (011E+ - 01 2 0E+-01” _1 0E+ - 01 3 -01” 

- 000 - 0(lll) 1/o 0-o(i”- 1 ) 1/0 o) =.= 

(1 + 00 + 010 +••• + 01 ”“ 2 0 ) 

+ (01” _1 E+ - 01” _1 0E + - 01” - 0(l”" 1 ) 1/ °0) = 

L n + (01"E+ - 0(l”- 1 ) 1/ °0) = L n + 01”E+, 

as required. □ 

Theorem 29. Let M = E* and E = {0,1}. There is an input-decreasing 
transducer t such that (/i*l) does not converge finitely. 

Proof. We consider the notation in the above lemma, and we use induction on 
n € No to show that 


p n l = {1,00,010,..., 01” _1 0}. 

The statement holds for n = 0. Assume it holds for some n, as displayed above, 
and consider calculating (/x” + 1 l). Using the definition of I(-) we have that 

( M ” +1 1) = I{p n 1) - t-\l{p n l)) = I( M ”1) - - M ”l), 

where we have used the fact t _ 1 (/i"l) D p n l = 0 as /r”l satisfies V t -i- Now 
using the above lemma we have that 

p n+1 l= (ji n l) U 01”E+ — t _ 1 (01"E + ). 

Using the definition of t one verifies that t(/z”l) n01”E + = 0 and that the only 
element of 01"E + that does not belong to t“ 1 (01"E + ) is 01"0. Then it follows 
that (ii n+1 1 ) = ( p n l ) + 01 ” 0 , as required. □ 

Example 30. In [9] the authors consider computing a maximal prefix code 
that is a subset of a given regular language L. We can replace prefix code with 
t-independent language, for any suitable input-decreasing transducer t, and 
approach this generalized problem by first computing X = L — t - 1 (L), which 
is always t-independent, and then use the iterated max-min operator to embed 
X into a maximal t-independent language relative to M = L. The work in [9] 
provides further results on maximal prefix codes that can possibly be extended 
to certain t-independences—see also the last section for further comments. 
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7 Conclusion 


We have shown that when an independence property is described by an input- 
decreasing transducer t, then the max-min operator pt can be iterated on any 
language to produce a maximal embedding. This approach works for many 
natural independence properties from both the noiseless and noisy domains of 
coding theory, as well as for any combinations of such properties. We conclude 
with a few directions for future research. 

• Find out whether, for any given regular bifix code, the max-min operator 
converges finitely. We believe that the answer here is yes. 

• Find out whether any regular maximal t-independent language is the re¬ 
sult of applying the iterated max-min operator on some initial finite lan¬ 
guage. This might be true for some cases of t. Related results of this type 
exist in [13,14]. 

• Study the behaviour of p* on various finite languages, in particular on 
singleton languages {w}. In this setting, we can talk about the code 
generated by w. We note that many substitution-detecting codes (CRC 
codes in particular) are generated from a single word, which in fact is 
represented by a polynomial [17]. 

• Explore the quality of the maximal languages generated by p*. In terms 
of information theory, quality could be the average word length, or the 
efficiency of encoding information, for instance. In terms of automaton 
theory, investigate the state complexity of the regular maximal languages 
in terms of the state complexity of the initial language, for various cases 
of t. A study of this type for prefix codes can be found in [9]. 

• Find out whether the following problem is computable: given any input- 
altering transducer t, return (if possible) an input-decreasing transducer 
x such that t V t” 1 = x V x” 1 —that is, Vt = V x . 
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